Visualising Search Result Sets Using a Force-Based Method to Form Clusters of Similar Documents
نویسنده
چکیده
As human knowledge increases, so the volume of electronically available information grows. Finding specific information becomes more difficult and ever more matches are returned in response to a search query. Since quantity is seldom quality, numerous approaches to make sense of search result sets have been proposed. This thesis describes an approach called SearchVis to visualise search result sets, which is based on an approach by Matthew Chalmers described in his 1996 paper ’A Linear Iterative Layout Algorithm for Visualising High-Dimensional Data’. The visualisation concentrates on the similarities between the documents retrieved. An animated, force-based technique produces clusters of similar documents. Through this technique similar documents are attracted and non-similar documents repelled. SearchVis allows the user to adjust the visual discrimination of the clusters using different parameters. It was tested with a varity of test data sets for a wide range of parameter settings. In order to reach as wide an audience as possible, SearchVis was written in Java.
منابع مشابه
بررسی نقش انواع بافتار همنویسهها در تعیین شباهت بین مدارک
Aim: Automatic information retrieval is based on the assumption that texts contain content or structural elements that can be used in word sense disambiguation and thereby improving the effectiveness of the results retrieved. Homographs are among the words requiring sense disambiguation. Depending on their roles and positions in texts, homograph contexts could be divided to different types, wit...
متن کاملComparison of Strategic Plans of Universities and Institutes of Higher Education with a Quantitative Approach
Strategic planning in Iranian universities and institutes of higher education is generally prepared using strategic planning models introduced by experts and other universities. These programs will be published in the form of university strategic planning documents. These documents have such features that can be similar or different than the programming templates used. Existence of the similar...
متن کاملText Summarization Using Cuckoo Search Optimization Algorithm
Today, with rapid growth of the World Wide Web and creation of Internet sites and online text resources, text summarization issue is highly attended by various researchers. Extractive-based text summarization is an important summarization method which is included of selecting the top representative sentences from the input document. When, we are facing into large data volume documents, the extr...
متن کاملCoronavirus: Discover the Structure of Global Knowledge, Hidden Patterns & Emerging Events
Background & Objective: The present study aimed at exploring the structure of global knowledge, hidden patterns, and emerging Coronavirus events using co-word techniques. Co-word analysis is one of the most efficient scientific methods to analyze the structure and dynamics of knowledge and the general state of research. Materials & Methods: This applied research performed using Co-word anal...
متن کاملClustering multilingual documents by estimating text - to - text semantic relatedness
This thesis is about multilingual document clustering through estimating semantic relatedness between multilingual texts. Specifically we focus on the task of clustering multilingual documents with very limited or no supervisory information. We present two approaches to address the problem : a comparable-corpora based approach and a web-searches based approach. Our first approach derives pairwi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997